Task Independent Speech Verification Using SB-MVE Trained Phone Models
نویسندگان
چکیده
Robust ASR-systems should benefit from detecting when portions of the decoded hypotheses are incorrect. This can be done by including a separate verification module based on statistical hypothesis testing. String based minimum verification error (SB-MVE) training is a promising alternative for improving the corresponding verification-models. This paper adresses a variant of SB-MVE at the phone level for design of task independent verification modules. The algorithm updates both H0 and H1 phone models. Experiments are performed on ”time of day” recordings of the Norwegian part of Speechdat (II). The results show a relative decrease in utterance error rate (compared to no verification) from 8 − 37% for false rejection rates ranging from 0 − 25%. Thus the method shows robustness with respect to choice of treshold.
منابع مشابه
Subword-based minimum verification error (SB-MVE) training for task independent utterance verification
In this paper we formulate a training framework and present a method for task independent utterance verification. Verification-specific HMMs are defined and discriminatively trained using minimum verification error training. Task independence is accomplished by performing the verification on the subword level and training the verification models using a general phonetically balanced database th...
متن کاملSegment-based phonetic class detection using minimum verification error (MVE) training
In this paper, we investigate the performance of segment-based detectors for three taxonomic sets of acoustic-phonetic classes. Acoustic-phonetic detectors form an important processing layer for speech event decoding in the new detection-based automatic speech recognition. In this study, detectors are trained within a minimum verification error (MVE) framework which is markedly different from t...
متن کاملComparing different model configurations for language identification using a phonotactic approach
In this paper different model configurations for language identification using a phonotactic approach are explored. Identification experiments were carried out on the 11-language telephone speech corpus OGI-TS, containing calls in French, English, German, Spanish, Japanese, Korean, Mandarin, Tamil, Farsi, Hindi, and Vietnamese. Phone sequences output by one or multiple phone recognizers are res...
متن کاملA segmental approach to text-independent speaker verification
Current text-independent speaker veri cation systems are usually based on modeling globally the probability density function (PDF) of the speaker feature vectors. In this paper, segmental approaches to text-independent speaker veri cation are discussed. Unlike the schemes based on Large Vocabulary Continuous Speech Recognition (LVCSR) with previously trained phone models, our systems are based ...
متن کاملSpeaker verification using minimum verification error training
We propose a Minimum Verification Error (MVE) training scenario to design and adapt an HMM-based speaker verification system. By using the discriminative training paradigm, we show that customer and background models can be jointly estimated so that the expected number of verification errors (false accept and false reject) on the training corpus are minimized. An experimental evaluation of a fi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004